Emerging practices for mapping and linking life sciences data using RDF - A case series

نویسندگان

  • M. Scott Marshall
  • Richard D. Boyce
  • Helena F. Deus
  • Jun Zhao
  • Egon L. Willighagen
  • Matthias Samwald
  • Elgar Pichler
  • Janos Hajagos
  • Eric Prud'hommeaux
  • Susie Stephens
چکیده

Members of the W3C Health Care and Life Sciences Interest Group (HCLS IG) have published a variety of genomic and drugrelated datasets as Resource Description Framework (RDF) triples. This experience has helped the interest group define a general data workflow for mapping health care and life science (HCLS) data to RDF and linking it with other Linked Data sources. This paper presents the workflow along with four case studies that demonstrate both the workflow and many of the challenges that may be faced when using the workflow to create new Linked Data resources. The first case study describes the creation of linked RDF data from microarray datasets while the second discusses a linked RDF dataset created from a knowledge base of drug therapies and drug targets. The third case study describes the creation of an RDF index of biomedical concepts present in unstructured clinical reports and how this index was linked to a drug side-effect knowledge base. The final case study describes the initial development of a linked dataset from a knowledge base of small molecules. This paper also provides a detailed set of recommended practices for creating and publishing Linked Data sources in the HCLS domain in such a way that they are discoverable and useable by users, software agents, and applications. These practices are based on the cumulative experience of the Linked Open Drug Data (LODD) task force of the HCLS IG. While no single set of recommendations can address all of the heterogeneous information needs that exist within the HCLS domains, practitioners wishing to create Linked Data should find the recommendations useful for identifying the tools, techniques, and practices employed by earlier developers. In addition to clarifying available methods for producing Linked Data, the recommendations for metadata should also make the discovery and consumption of Linked Data easier.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dealing with the Messiness of the Web of Data

Research on the Semantic Web, which is now in its second decade, has been successful in encouraging people to publish data on the Web in structured, linked, and standardized ways. The success of what has now become the Web of Data can be seen from the sheer number of triples available within the Linked Open Data, Linked Life Data and Open Government initiatives. However, this growth in data mak...

متن کامل

A New Model Representation for Road Mapping in Emerging Sciences: A Case Study on Roadmap of Quantum Computing

One of the solutions for organizations to succeed in highly competitive markets is to move toward emerging sciences. These areas provide many opportunities, but, if organizations do not meet requirements of emerging sciences, they may fail and eventually, may enter a crisis. In this matter, one of the important requirements is to develop suitable roadmaps in variety fields such as strategic, ca...

متن کامل

Evaluation of Corporate Governance Practices in Emerging Markets (A case study of Nigerian Banking Industry)

This study explores corporate governance practices within the context of the Nigerian banking industry using instances of corporate governance lapses that resulted in part to the Nigerian banking crises. We present multiple case analysis of publicly available documents and court papers (in the United Kingdom and Nigeria) to document instances of breach and areas of weakness in the existing Nige...

متن کامل

YeastHub: a semantic web use case for integrating data in the life sciences domain

MOTIVATION As the semantic web technology is maturing and the need for life sciences data integration over the web is growing, it is important to explore how data integration needs can be addressed by the semantic web. The main problem that we face in data integration is a lack of widely-accepted standards for expressing the syntax and semantics of the data. We address this problem by exploring...

متن کامل

The EBI RDF platform: linked open data for the life sciences

MOTIVATION Resource description framework (RDF) is an emerging technology for describing, publishing and linking life science data. As a major provider of bioinformatics data and services, the European Bioinformatics Institute (EBI) is committed to making data readily accessible to the community in ways that meet existing demand. The EBI RDF platform has been developed to meet an increasing dem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Web Sem.

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2012